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Abstract. In 1964 Cerny conjectured that each n-state synchronizing 
automaton posesses a reset word of length at most (n — l) 2 . From the 
other side the best known upper bound on the reset length (minimum 
length of reset words) is cubic in n. Thus the main problem here is to 
l_J ■ prove quadratic (in n) upper bounds. Since 1964, this problem has been 

solved for few special classes of synchronizing automata. One of this re- 
sult is due to Kari [S] for automata with Eulerian digraphs. In this paper 
fj ■ we introduce a new approach to prove quadratic upper bounds and ex- 

plain it in terms of Markov chains and Perron-Frobenius theories. Using 
this approach we obtain a quadratic upper bound for a generalization of 

Eulerian automata. 
> 

1 Synchronizing automata and the Cerny conjecture 

m ' 

--? • Suppose £/ is a complete deterministic finite automaton whose input alphabet 

f— ^ | is S and whose state set is Q. The automaton s^ is called synchronizing if there 

£Nj ■ exists a word w £ S* whose action resets &/, that is, w leaves the automaton in 

one particular state no matter at which state in Q it is applied: q.w — q' .w for all 
q, q' G Q. Any such word w is called reset (or synchronizing) for the automaton. 
The minimum length of reset words is called reset length and can be denoted by 

/S '• <r«). 

jrt ! Synchronizing automata serve as transparent and natural models of error- 

resistant systems in many applications (coding theory, robotics, testing of reac- 
tive systems) and also reveal interesting connections with symbolic dynamics and 
other parts of mathematics. For a brief introduction to the theory of synchroniz- 
ing automata we refer the reader to the recent survey [14] . Here we discuss one of 
the main problems in this theory: proving an upper bound of magnitude 0(n 2 ) 
for the minimum length of reset words for n-state synchronizing automata. 

In 1964 Cerny [4 constructed for each n > 1 a synchronizing automaton ^ 
with n states whose shortest reset word has length (n— l) 2 , i.e. £(%) = (n— l) 2 . 
The automaton ^4 is drawn on figure [1] Soon after that he conjectured that 
those automata represent the worst possible case, thus formulating the following 
hypothesis: 



Conjecture 1 (Cerny) Each synchronizing automaton s/ with n states has a 
reset word of length at most (n — I) 2 , i.e. £(&/) < (n — l) 2 . 

By now this simply looking conjecture is arguably the most longstanding open 
problem in the combinatorial theory of finite automata. Moreover, the best up- 
per bound known so far is due to Pin 10JJ (it is based upon a combinatorial 
theorem conjectured by Pin and then proved by Frankl [6]): for each synchroniz- 
ing automaton with n states, there exists a reset word of length - ~" . Since this 
bound is cubic and the Cerny conjecture claims a quadratic value, it is of certain 
importance to prove quadratic (upper) bounds for some classes of synchronizing 
automata. 





Fig. 1. Automaton % and its underlying graph 



2 Exponents of primitive matrices vs reset thresholds 



In the rest of the paper, we assume that srf is a synchronizing n-state automaton 
with fc-letter input alphabet S = {ai, a^, ■ ■ ■ , a^} and whose state set is Q. We 
also assume n > 1, k > 1 and £/ is strongly connected because finding reset 
words of length 0(n 2 ) can be easily reduced to this case (see [5] for example). 
Now let us consider relations between primitive matrices and synchronizing au- 
tomata. In order to do this we determine a natural linear structure associated 
with automata. We mean the states of s/ as numbers 1, 2, . . . , n and then assign 
to each subset T C Q its characteristic vector [T] in the linear space W 1 defined 
as follows: the i-th entry of [T] is 1 if i € T, otherwise it is equal to 0. As usually, 
for any two vectors g\ , gi £ M. n we denote the inner product of these vectors by 
(5i,32)- 

A matrix M is primitive if it is non-negative and its m-th power is positive 
for some natural number m. The minimum number m with this property is called 
an exponent of the matrix M and can be denoted by exp(M) . Let us also define 
a weak exponent of matrix M as a minimum number m such that M m has a 



1 An upper bound of order Q(^-) has been proved in [13) . But we know about one 
unclear place in the proof of this result. 



positive row and denote it by wexp(M). Note that a (weak) exponent depends 
only on the set of indices with positive elements Supp(M) — {(i,j) | Mij > 0} 
and doesn't depend on their values. So when we consider exponents of some 
matrix M we can assume it is a 1-0 matrix and there is a corresponding graph 
with the adjacency matrix M . Moreover, M\ is equal to the number of directed 
paths of length exactly t from state i to j in the corresponding graph. The 
following proposition shows the basic properties of primitive matrices. 

Proposition 1. Let M be n X n primitive matrix. Then 

1. wexp(M) < exp(M) < wexp(AI) + n — 1; 

2. exp(M) < (n — l) 2 + 1 and equality holds only for Wielandt matrices; 

3. wexp(M) < (n- l) 2 . 

Proof. The left part of the inequality in item follows immediately from the def- 
initions. The right part can be easily proved in terms of graph theory. Indeed, 
let i be the index of a positive row in M wex p( M ) , Then there is a path (in the 
corresponding graph) of length d(i,t) < n — 1 from the state i to each state t. 
Further, for each state s there is a path to some state q of length n — 1 — d(i, t) 
and a path of length wexp(M) from q to i. Thus there is a path s — > q —> i — >• t 
of length 

n — 1 — d{i, t) + wexp(M) + d{i, t) — wexp(M) + n — 1 

and the item is proved. 

Item [2] has been proven by Wielandt [15] . Item |3] follows from the fact that 
exp(W) = (n — l) 2 + 1 only for the Wielandt matrix W (item[2]) but wexp(W) — 
n 2 -3n + 3 < (n- l) 2 . 

The proof of the following proposition can be found in [1] but we introduce 
it here to be self-contained. 

Proposition 2. Let UG(£/) denotes the underlying graph of the automaton si 
and M = M(UG(srf)) denotes its adjacency matrix. Then 

2. M is a primitive matrix; 

3. wexp(M) < £(#?). 

Proof. Item Q] follows immediately from definitions. Since &/ is a synchronizing 
automaton there exists a reset word w of length C(^/) which takes all the states 
of the automaton to some state i. This means that i-th row of M I 10 ' is positive, 
so item |3] is proved. Since jzf is strongly connected item [2] is true also. 

It follows from above propositions that weak exponent of the underlying 
graph of &/ is at most (n — l) 2 . This means that there are (unlabeled) paths 
of equal length / < (n — l) 2 from every state of the automaton srf into some 
particular state. The Cerny conjecture asserts additionally that such paths can 
be chosen to be labeled by some fixed (reset) word. It seems that this additional 



demand should increase significantly the minimum length of such paths. Indeed, 
for a lot of synchronizing automata the reset length is much more than the weak 
exponent of its underlying graph. For instance, if synchronizing automata con- 
tains a loop then its weak exponent is at most n — 1 but its reset length can be 
equal (n — l) 2 (for Cerny series). However, in order to prove the Cerny conjec- 
ture we only need such bound in the worst case and in [I] a strong connection 
between distribution of reset lengths of synchronizing automata and exponents 
of primitive graphs is considered. 

3 Markov chains and an extension method 

The aim of this paper is to obtain upper bounds on reset lengths by utiliz- 
ing its connection with exponents of primitive graphs. Let we have probabil- 
ity vector p e R\ on £ naturally extended on words as p(v) = Ili=i? , (' l '(*))- 
Now consider a random process of walking some agent in the underlying graph 
G = UG{szf) walking by arrow labeled by a^ with probability p(ai). Then the 
matrix S(g/,p) — J2i=i P( a i) * a^ is a probability matrix of this Markov process. 
Let us note that Supp(S(g/,p)) — Supp{M(UG{s/))) and S{s& \p) is also col- 
umn stochastic. To simplify our notations denote by 1„ a vector in R n with all 
components equals — . The following proposition summarize properties of Markov 
chains that we need. 

Proposition 3. Let S be a column stochastic n x n primitive matrix of some 
Markov process. Then 

1. 1„ is a left eigenvector of S, i.e. S t l n = l n ; 

2. there exists a steady state distribution a — a(S) £ i?" of this Markov process, 
i.e. Sa = a and (a, nl n ) = 1: 

3. 1 is a unique modulo-maximal eigenvalue of S and the corresponding eigenspace 
is one dimension; 

Proof. Since S is a column stochastic matrix then S l n — 1„. Thus 1 is an eigen- 
value of S and corresponding eigenvector [Q] is positive. Since S also is primitive 
then by Perron-Frobenius theorem 1 is a unique modulo-maximal eigenvalue of 
S and there is also a unique (right) positive eigenvector a, i.e. Sa = a. Note 
that a can be chosen to be stochastic. Also by Perron-Frobenius right and left 
eigenspaces corresponding to the eigenvalue 1 are one dimension and equals to 
< 1„ >, < a > respectively. 

For K C Q and v G S* we denote by K.v and K.v^ 1 the image and respec- 
tively the preimage of the subset K under the action of the word v, i.e. 

K.v = {q.v | q £ K} and K.v~ = {q \ q.v G K}. 

One can easily check that [K.v] = [v][K], [K.v' 1 ] = [v*][K] and ([K], 1„) = ^. 
In order to simplify our notations we further omit square brackets. Recall that a 
word w is reset if and only if q.w~ l for some state q or equivalently w t q= [Q] . Let 



P be any positive stochastic vector. Then w is reset if and only if ([q.w^ 1 ], P) = 
(w t q, P) = 1. It follows from w t q is a 1-0 vector and P is positive. 

Remark that one of the most fruitful method for finding quadratic upper 
bounds on the reset length is an extension method. In this method we choose 
some state q and construct a finite sequence of words wx,w 2 , . . . ,Wd such that 

- = (<7, 1„) = {wiq, 1„) < {w 2 wi t q, 1„) < ■ ■ ■ < (w d . . . w 2 wi t q, 1„) = 1. 
n 

It is clear that such sequence can be constructed for any synchronizing automa- 
ton and its length d is at most n — 1 because each inner product in the sequence 
exceeds previous for at least — . Thus a quadratic upper bound will be proved as 
soon as one proves that the lengths of Wi can be bounded by linear (in n) func- 
tion. For instance, if |tOi| < n for s/ then it can be easily shown that the Cerny 
conjecture holds true for si '. Using this fact the Cerny conjecture has been ap- 
proved for circular [5], eulerian [8] and one- cluster automata with prime length 
cycle [IT] . However, it is shown in [2] (see also the journal version [3]) that there 
is a series of synchronizing automata where lengths of Wi can not be bounded by 
en for any 1 < c < 2. This means that for some proper subset x C Q inequality 
(v x, l n ) < (x, 1„) holds true for each word v of length at most en. Therefore the 
Cerny conjecture can not be always achieved on this way. This suggests an idea 
to find a stochastic positive vector P such that for each proper subset x C Q 
there exists a word v of length at most n such that {v t X,P) > {x,P). It turns 
out that the vector a = a(S(s/,p)) (the steady state distribution of Markov 
chain associated with s/ and probability vector p) satisfies this property. 

Theorem 1. Let x G R n such that (x, a) = and v G S* be a word of minimum 
length such that (v x, a) > 0. Then 

1 - J2ueS''P( u )( utx ^ a ) = ° f or an V r e N >" 

2. if \u\ < \v\ then (u f x, a) = 0; 

3. \v\ < dim{S^ n - 1 a) - 1 < n - 1. 

Proof. Items 1 1121 immediately follow from S r a = a for any r € N. If 
|u| > rfir7i(i7-™ _1 a) then from item [5] (it* x, a) = (x,ua) = for every u, \u\ < 
dim(£~ n ~ l a) . For i G {1, 2, . . . , n} define a subspace Ui —< ua \ \u\ < i — 1 >. 
Then a chain 

< a >= V x < U 2 < • ■ ■ < U n = U^^a. 
becomes constant since some j < dim(U n ) < \v\, i.e. 

U X < U 2 < ■ ■ ■ < Uj : = U j + 1 = ■ ■ ■ = U n . 

Thus (x, ua) = for every u, \u\ < dim(U n ) = dim(Uj) whence (x,g) = for 
each g G U<n m (u)- Since j has been chosen minimal dim{Uj) > j then Uj 3 
Udim(Uj)- So (x,g) — for each g G Uj = Uj+\ = . . . and j < dim(U„) < \v\. 
Since va G U\ v \ — Uj then (x, va) = and this contradicts with («*£, a) > 0. 



It is worth to mention that similar view to synchronization process as a 
probability process were early studied by Jungers [7J but in contrast of linear 
programming techniques by Jungers we use techniques from Perron-Frobenius 
theory. Moreover, the main result of [7J is a similar proposition like in above 
theorem. But in contrast with Jungers result we have a fixed vector a here and 
thus obtain quadratic upper bound for a new class of automata in the next 
section. 



4 Quasi Eulerian Automata 

In view of theorem Q] the lengths of extension words (for a = a(S(si/,p)) instead 
1„) are bounded by n— 1. Unfortunately we have here a conjugate problem that 
the lengths of such sequences is hard to bound in general, because if (Ki,a) < 
(K% , a) for 1-0 vectors K\ , K2 then its difference (K2 — K\ , a) can be less than 
— . However, for some classes of synchronizing automata we can directly use this 
theorem. At first prove an auxiliary statement. 

Corollary 1. Let a — a(S(£/,p)) G Q n for some probability vector p on S and 
L G N denotes the least common multiple of denominators of a components. 
Then £{s/) < 1 + (n- 1)(L- 2). 

Proof. At first note that if X\,X2 are 1-0 vectors and (x2,a) > {x\,a) then 
(22, a) > (xi, a) + j-. Since srf is synchronizing there exists a state q and a letter 
a such that |a _1 q| > 1. Set w\ = a then (w±q, a) > (q, a) + 4 > j-. Suppose 
(w[q, a) < 1. Let x\ = w\q — \w\q\l n and W2 be a word of minimum length with 
{w2 t x\, a) > 0. Such word exists because 

(«*»!, a) = (Q- \w{q\l n ,a) = 1 - (w\q,a) > 

for any reset word u. In view of theorem [1] (it^l < n — 1 and (w2w\q,a) > 
(|ui*g|l n , «) + x — T- Continue in this way we construct a reset word WdWd-i ■ ■ ■ w\ 
where \w\\ = 1 and |w 2 | < n. Since we start from j- and each step adds to inner 

product at least \ then d < ^-^- < L - 2. Thus €.{sf) < 1 + (n - l)d < 

1 + (n — 1)(L — 2) and the corollary is proved. 

An automaton is Eulerian if its underlying graph admits an Eulerian directed 
path, or equivalently, it is strongly connected and the in-degree of every vertex 
is the same as the out-degree (and hence is the alphabet size). It is clear that srf 
is Eulerian if and only if S(srf, 1„) is doubly stochastic. Due to 12 srf is pseudo- 
Eulcrian if we can find a probability p such that S(#/,p) is doubly stochastic. 

Corollary 2. If srf is Eulerian or pseudo-Eulerian then <£(&/) < l + (n— l)(n — 
2). 

Proof. By condition we can choose a probability vector p on S to provide S(s?f, p) 
is row stochastic. Then a = a(S '{srf , p)) = 1„ and in view of corollary[T]we obtain 
the desired result. 



Remark that the same bounds for Eulerian and have been proved early by 
Kari [8 and later generalized for pseudo-Eulerian automata by Steinberg [T5] 
using another techniques. However, we now show that techniques suggested in 
this paper is more powerful in some sense. 

Proposition 4. Let a = a(S(j^,p)) for some probability vector p on £ and 
for some c > there are n — c equal numbers in a set of a components. Then 
£00 < 2 c (n-c+l)(n- 1). 

Proof. Without loss of generality let a = (n, r%, . . . ,r c ,r, r, . . . , r)* and K C Q. 
Let ft determine that K% — 1 for i G {1, 2, . . . , c} and f c+ be a number of l's in 
K with index more than c. Then (K,a) — J^ i=1 firi + f c+ f whence this value 
is determined by a vector f(K) = (/i, / 2 , . . • , f c , fc+) where f c+ € {0 . . .n - c} 
and fi G {0, 1}. Hence there are at most 2 c (n — c+ 1) possible different values of 
(X, a) and the length of any extension chain (for a) can not exceed 2 c (n — c + 1) . 
In view of theorem [T] we can choose words of length at most n — 1 and thus we 
obtain a desired bound. 

As a corollary of this proposition we can prove a quadratic upper bound on 
the reset length for a new class of synchronizing automata. We call automaton srf 
quasi- Eulerian with respect to c G N if there is an Eulerian or pseudo-Eulerian 
"component" E c with enter state s which contains n — c states, i.e. only state s 
can have incoming arrows from Q\E C and rows of S(.s/,p) which corresponds 
to vertices from E c — s are row stochastic for some p. 

Theorem 2. if £.(£/) is quasi- Eulerian with respect to c G N then €.(£/) < 
2 c (n-c+l)(n-l). 

Proof. By condition for appropriate probability vector p on £ we can provide 
that rows of matrix S — S(£/,p) corresponding to states in E c — s are stochastic. 
In view of theorem Q] a is a single positive solution of equation (S — E)x = 0. 
It is easy to show that all entries of a which corresponds to states from E c will 
have the same value whence we can apply proposition 0] to a,c and obtain the 
desired result. 

As an example of quasi-Eulerian we can consider automata ^ n from Cerny 
series. One can easily check that ^„ is quasi-Eulerian for c = 1 and thus upper 
bound £('^' n ) < 2n(n — 1) follows from theorem [51 Finally, let us express our 
hope that ideas suggested in this paper could be useful for the general case. 
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